CDS

Accession Number TCMCG074C03620
gbkey CDS
Protein Id KAF8378516.1
Location join(11202957..11203433,11203574..11203757,11204316..11204453,11205757..11205927,11206014..11206117,11206577..11206644,11210840..11211006,11213439..11213508,11215538..11215692,11218030..11218135,11218408..11218495,11218576..11218719,11219558..11219788)
Organism Tetracentron sinense
locus_tag HHK36_029859

Protein

Length 700aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000023.1
Definition hypothetical protein HHK36_029859 [Tetracentron sinense]
Locus_tag HHK36_029859

EGGNOG-MAPPER Annotation

COG_category OU
Description protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K04773        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCGAAACTTCTCTGTACTTCTCATTTGAACCCGTTAGGTCGCCGGAGATTTTCTGCAATTTTATCCAAATCTCCGGTGCCGATTCAACTCAGTCTCTCACCGTCTCGTTTCCCCTTCGAATTCCACTTCTCTCTTCGTAATCACTTCCCAAATCGTCAAATTCATCTTCGATTTCTTCGTAATTCAGTTCATCAAAGGGGTTTATCAGTTCGAGCCCTAGAATCGTCTTCTGAAACTAAGGATGAGGAGGTTTCGAAGAAGGAAGACGAATCCTCTTCTTCTGAAACCGAAACTAGGTCTGTGGATGTTAATGGAAGTTTGGTAGGATCAAATGATGATTATCCAAGCGGTGATTTTGAGTTCAAGGAGATTAATGGATGGATGAGCTTTGTCGTGAAGCTTCGAATGCTCTTTGCATTCCCATGGGAGCGTGTTCGGAAGGGAAGCGTACTTTCAATGAAGCTTCGAGGCCAGATATCTGAACAGCTAAAGAGCCGTTTCTCTTCAGGACTATCTCTGCCTCAAATTTGCGAAAATTTCTCAAAAGCAGCATACGATCCTCGTATCTCTGGTGTCTATCTTCAGATAGAACCCCTGAGCTGTGGGTGGGGCAAAGTTGAAGAGATACGGAGGCATATACTAAATTTCAGGAAGTCAGGTAAATTCATTGTGGGCTATGTCTCAGTTTGTGGGGAGAAAGAGTATTACCTTGGTTGTGCTTGTGAAGAGCTATATGCTCCTCCTAGTGCTTATTTTGCTTTGTATGGTTTGACGGTTCAAGCATCATTTCTCGGGGGTGTTCTTGAAAAAGTGGGAATTGAACCACAAGTGCAAAGAATTGGTAAATATAAAAGCGCCGGGGATCAACTCATGCGCAAAAACATGTCAAAAGAAAATTGTGAGATGCTGACTGCATTGCTAGATAGCATCTACGGAAACTGGCTTGATAAAGTTTCTTCTACTAAAGGGAAGAGAAGAGAAGAAATTGAGAATTTCATTGATGAAGGAGTTTATCAGATTGAAAGGCTGAAAGAAGGAGGCTGGATAACAAATATCCATTACGATGACGAGGTTATCTCAATGTTGAAAGAGAGATTGGGATTGAAGAAGGAGAAAAATCTTCCAATGGTTGATTACAGGAAATACTCTAGAGTCAGAAACTGGACTCTTGGTTTATCTGGAGGAAAAGACCAAATAGCTGTGATCAGAGCCTCTGGAAGCATTAGTCGTGTTCGTGGTCCATTTAGTGTATCTAATTCGGGTATTATTAGTGAGCAGTTCATTGAGAAGATTCGTAGTGTAAGAGAGTCAAAAAGATACAAGGCTGCTATCATACGAATTGACAGCCCTGGAGGTGATGCTCTTGCTTCTGATTTAATGTGGAGGGAAATTGGACTTCTGGCTGCCTCAAAACCTGTCGTTGCATCAATGTCTGATGTGGCTGCTAGTGGAGGGTACTACATGGCAATGGCAGCAGGGACTATTGTTGCAGAGAATCTTACCTTAACAGGTTCAATTGGAGTTGTTACAGGAAAGTTTAATTTGGGCGAACTATATGAAAGGATTGGCTTCAATAAGGAGATTATATCAAGGGGAAAATATGCTGAGCTCACTGCTGCTGAACAACGTCCTTTCAGGCCAGATGAAGCAGAACTCTTTGCTAAATCTGCTCAGAATGCATATAAACAATTCCGAGACAAAGCAGCCCTTTCAAGATCAATGACTGTAGAGCAGATGGAGGAAATTGCTCAAGGAAGGGTATGGGCTGGTAAAGATGCAGCTTCCCGAGGTTTGGTTGATGCCATTGGTGGGCTCTCTCGAGCTGTTGCAATTGCAAAACAGAAGGCCAACATACCCCAAGACAAACAGGTTAAACTCGTTGAGATGTCAAGACCATCACCCACTGTGCCAGAGATCTTAACTGGTATAGGGAGTTCCCTTGTTGGAGTGGACAGGACTTTGAAGGAGCTCTTGCAAGACTTGACATTTTCTGATGGAGTCCAAGCCAGAATGGATGGAATCATGTTTCAGAGATTGGAGGGAGCTTCTTTTGCCAACCCCATATTTACTTTAATAAAGGACTACCTAAGTTCCCTTTGA
Protein:  
MSKLLCTSHLNPLGRRRFSAILSKSPVPIQLSLSPSRFPFEFHFSLRNHFPNRQIHLRFLRNSVHQRGLSVRALESSSETKDEEVSKKEDESSSSETETRSVDVNGSLVGSNDDYPSGDFEFKEINGWMSFVVKLRMLFAFPWERVRKGSVLSMKLRGQISEQLKSRFSSGLSLPQICENFSKAAYDPRISGVYLQIEPLSCGWGKVEEIRRHILNFRKSGKFIVGYVSVCGEKEYYLGCACEELYAPPSAYFALYGLTVQASFLGGVLEKVGIEPQVQRIGKYKSAGDQLMRKNMSKENCEMLTALLDSIYGNWLDKVSSTKGKRREEIENFIDEGVYQIERLKEGGWITNIHYDDEVISMLKERLGLKKEKNLPMVDYRKYSRVRNWTLGLSGGKDQIAVIRASGSISRVRGPFSVSNSGIISEQFIEKIRSVRESKRYKAAIIRIDSPGGDALASDLMWREIGLLAASKPVVASMSDVAASGGYYMAMAAGTIVAENLTLTGSIGVVTGKFNLGELYERIGFNKEIISRGKYAELTAAEQRPFRPDEAELFAKSAQNAYKQFRDKAALSRSMTVEQMEEIAQGRVWAGKDAASRGLVDAIGGLSRAVAIAKQKANIPQDKQVKLVEMSRPSPTVPEILTGIGSSLVGVDRTLKELLQDLTFSDGVQARMDGIMFQRLEGASFANPIFTLIKDYLSSL